Efficient and Effective Visual Codebook Generation Using Additive Kernels
نویسندگان
چکیده
Common visual codebook generation methods used in a bag of visual words model, for example, k-means or Gaussian Mixture Model, use the Euclidean distance to cluster features into visual code words. However, most popular visual descriptors are histograms of image measurements. It has been shown that with histogram features, the Histogram Intersection Kernel (HIK) is more effective than the Euclidean distance in supervised learning tasks. In this paper, we demonstrate that HIK can be used in an unsupervised manner to significantly improve the generation of visual codebooks. We propose a histogram kernel k-means algorithm which is easy to implement and runs almost as fast as the standard k-means. The HIK codebooks have consistently higher recognition accuracy over k-means codebooks by 2–4% in several benchmark object and scene recognition data sets. The algorithm is also generalized to arbitrary additive kernels. Its speed is thousands of times faster than a naive implementation of the kernel k-means algorithm. In addition, we propose a one-class SVM formulation to create more effective visual code words. Finally, we show that the standard kmedian clustering method can be used for visual codebook generation and can act as a compromise between the HIK / additive kernel and the k-means approaches.
منابع مشابه
Image Compression with Efficient Code Book Initialization Using Lbg Algorithm Image Compression with Efficient Codebook Initialization Using Lbg Algorithm
Vector quantization (VQ) has received a great attention in the field of multimedia data compression since last few decades because it has simple decoding structure and can provide high compression ratio. In general, algorithms of VQ codebook generation focus on solving two kinds of problem: (i) to determine the quantization regions and the code words that minimize the distortion error. (ii) to ...
متن کاملAn Efficient and Effective Method for VQ Codebook Design
Although there have been many methods proposed to speed up the VQ codebook generation process, it turns out to be the case that these methods either suffer from slight degradation in overall distortion or just maintain the same quality with hardly time saving. In other words, the methods proposed to generate better codebooks are often inefficient, and those which are fast tend to have worse qua...
متن کاملImage Compression with Efficient Codebook Initilization Using Lbg- Optimization Algorithm
In this paper we present a very simple and yet effective algorithm to generate good codebook. In general VQ codebook generation algorithm focus on solving two problem(i)To reduce the computational complexity of code words search the building the codebook. (ii) Extra Computational overhead to calculate the measurement of codeword in codebooks..in this paper, a novel VQ codebook generation method...
متن کاملBilevel Visual Words Coding for Image Classification
Bag-of-Words approach has played an important role in recent works for image classification. In consideration of efficiency, most methods use kmeans clustering to generate the codebook. The obtained codebooks often lose the cluster size and shape information with distortion errors and low discriminative power. Though some efforts have been made to optimize codebook in sparse coding, they usuall...
متن کاملMetric Learning in Codebook Generation of Bag-of-Words for Person Re-identification
Person re-identification is generally divided into two part: first how to represent a pedestrian by discriminative visual descriptors and second how to compare them by suitable distance metrics. Conventional methods isolate these two parts, the first part usually unsupervised and the second part supervised. The Bag-of-Words (BoW) model is a widely used image representing descriptor in part one....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of Machine Learning Research
دوره 12 شماره
صفحات -
تاریخ انتشار 2011